Model Selection

RoBERTa Variant

# RoBERTa Variant

Efficient Mlm M0.15 801010

A RoBERTa model employing pre-layer normalization technology, studying the impact of masking ratio in masked language modeling

Large Language Model

Efficient Mlm M0.40

A masked language model based on the RoBERTa architecture, employing pre-layer normalization technology to explore the impact of masking ratios on model performance

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase